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ADDITIVE COMBINATORICS, CUBIC FORMS ON F^ AND 
HURWITZ SQUARE IDENTITIES 

SOPHIE MORIER-GENOUD AND VALENTIN OVSIENKO 

Abstract. We apply the Hurwitz-Radon theory of square identities to additive combinatorics 
of sumsets in F2 • We fix an arbitrary cubic function a on F2 and obtain information about the 
size and structure of a set yl C F? satisfying q L . , ,,,,„, = 1. We then consider two differrent 
sets, A and B, and obtain a lower bound for the size of the sumset A + B under a similar 
fT^ ■ condition for the additive quadruples. Our main tool is the non-associative algebra associated 

to a cubic function on F2 . 

o' 
u 

^ • 1. Introduction 

In this paper, we investigate relationship between two different subjects: additive combi- 
natorics of sumsets in F2 and the theory of "square identities" (or "composition of quadratic 
forms"), initiated by Hurwitz |14j . Starting from two sets, A,B C Fg, and assuming some 
restriction on additive quadruples in A x A x B x B, we construct a square identity. The main 
Q^ . ingredient of our construction is a cubic binary (or boolean) form. This construction allows us 

'nJ" I to establish several statements about sizes of sumsets, as well as statements about binary cubics. 

SsD ' We will need to fix the notation and recall several standard notions of combinatorics over F2. 

— il ■ • An element a; G F2 is represented as an n-tuple of and 1: x = (xi, . . . ,rE„). The 

f^ . Hamming weight wt{x) is the number of non-zero components Xj. 

Cn I • A cubic form on F2 is a function a : F2 — > F2 of the form 

0![^X) — ^ ^ 0!ijk X^XjXk, 

H I where the coefficients aijk € {0, 1}. Note that, over F2, we have xf = Xi and therefore 

" " " every cubic form can be viewed as a homogeneous form. 

• The cardinality of a set A C F2 is denoted by |^|. 

• Given two sets A,B C F2 , the sum,set A + B \s the set of elements of the form a + b with 
ae A,b ^ B. 

We also recall two central notions of the Hurwitz theory. 

• A square identity of size [r, s^N] is an identity of the form 

{al + --- +al){bl + --- +bl) = cl + --- +c%, 

where Cj are bilinear expressions in aj and b^ with coefficients in Z. In |14j . Hurwitz 
formulated his famous problem to determine all the triples (r, s, N) such that there exists 
an identity of size [r, s, A^]. The values (r, s, A^) are optimal if r and s cannot be increased 
and A^ cannot be decreased. 
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• The Hurwitz- Radon function p is a function on the set of natural numbers p : N ^^ N. 
U N = 2" (2m + 1), then p{N) = p(2") (i.e., it depends only on the dyadic part of N), 
and the latter number is given by 

P(2") = 

• The celebrated Hurwitz- Radon theorem \15\ I19|: see also [21], is formulated as follows: 
there exists an identity of size [r,N,N] if and only if r < p{N). This is the only 
case where the Hurwitz problem is solved. Importance of the Hurwitz problem is due 
to various applications of square identities and their direct relations to many areas of 
number theory, algebra, geometry and topology. 

Starting from two sets A, B and their sumset A + B, and assuming some conditions in terms 
of (an arbitrary) cubic form a, we produce a square identity of size [|^|, \B\, \A + B\]. This 
approach was developed in [17 1 116 1 [T8] iii order to construct new square identities. In this paper, 
we apply the same technique to, conversely, obtain information about additive combinatorics of 
sumsets from known results on square identities. 

In Section [21 we formulate our results about sumsets. Sections [3l[5] contain all our construc- 
tions: the correspondence between cubic functions on Fg and real non-associative algebras, the 
correspondence between sumsets and square identities, explicit examples of maximal subsets for 
Hurwitz- Radon identities. Section [6] completes the proofs. 

2. The main results 

2.1. An upper bound. The classical Freiman theorem [7| and its analog over F2 due to Ruzsa, 
as well as various generalizations (see |20|[6l[9] and [8] for a survey of the whole subject) provide 
information about the structure of a set A under a restriction on the cardinality oi A-\- A. Our 
first result gives information about the cardinality of A under a restriction on the structure 
oiA + A. 

Theorem 1. Given a cubic function a : Fj ^^ F2 and a set A C Fj , if for every x, x' € A, such 
that X / x' , one has a{x + x') = 1, then 

\A\ < p(2"). 

This bound is sharp, at least in the cases n = 1, 2 or 3 (mod 4). 

We will give examples of a cubic form a and sets A satisfying the condition a{x + x') = 1 
and the equality |^| = p{2^) (except for the case n = (mod 4), where we have no examples). 
An immediate combinatorial consequence of the above theorem is the following statement. 

Corollary 2.1. If for every x,x' € A, such that x 7^ x' , the weight wt{x + x') is not a multiple 
of 4, then \A\ < p(2"). 

Note that replacing the condition multiple of 4 by another integer, say 3 or 5, one obtains that 
the maximal cardinality |^| is at least quadratic in n. The value 4 is the only value for which 
the upper bound is linear in n. 

The following statement is a refinement of Theorem [TJ 



Corollary 2.2. If for every x ^ x' £ A one has a{x + x') = 1, and A C V , where V is an affine 
subspace of ¥2, then \A\ < p{\V\). 

As above, we can replace the assumption a{x + x') = 1 by a more restrictive one that wt{x-\-x') 
is not a multiple of 4. 

Let us also emphasize that Theorem [1] implies that the set A cannot have a small doubling, 
namely |^ + A| > c|^p, where c is a constant (asymptotically j). 

2.2. Additive quadruples. Our second statement concerns so-called additive quadruples. If 
A,B C ¥2, four elements x, x' a A, y,y' £ B form an additive quadruple {x, x' , y, y') if 

x + x' + y + y' = 0. 

We call an additive quadruple proper if x ^ x' and y ^ y' . 

Theorem 2. Let A,Bc¥2 with \A\ < \B\. If every proper additive quadruple {x,x' ,y,y') 
satisfies a{x + x') = 1, then 

\A + B\ >n{\A\l). 

As above, the condition a{x + x') = 1 can be replaced by the condition that the weight 
wt{x + x') is not a multiple of 4. 

The Balog-Szemeredi-Gowers theorem [31 [11], in the F2 case (see |9]) states, roughly speak- 
ing, that the sumset A + B grows slowly, provided there are "many" additive quadruples (of 
order |j4p). The above result is a sort of converse statement. 

2.3. A few properties of cubic forms. Classification of boolean cubic forms on F'2 , modulo 
the action of the linear group GL(?7i,2) is unknown for n > 9; see |121 |3j and the website 
|htt p://langevin.univ-tln.fr/project/. The above theorems provide invariants of cubic forms, 
which are of course not sufficient for their classification. Given a cubic form a, the maximal 
cardinality of a set ^ C Fg such that a{x + x') = 1 for all x ^ x' a A, is obviously an invariant 
of Q. The maximal cardinality of |^ -|- -B|, where A, B are as in Theorem [2l is also an invariant. 

We also prove the following simple property of cubic binary forms. 

Proposition 2.3. // F C Fg is an (affine) subspace such that a{v) = 1 for all non-zero v £ V , 
then dim(y) < 3. 

2.4. The method. Hurwitz's problem remains widely open and no explicit formulas or asymp- 
totic for the optimal triplets (r, s, N) are known in general. However, the problem is old and 
much information is available; see |21j . 

Our method is based on the algebraic constructions developed in [TTJ [lU [l8] . For every cubic 
form a on Fg , we construct a real non-associative algebra with basis {cx \ x G Fg } and the 
product encoded by a. The square identities are realized in the form ||a|| ||6|| = ||a6||, provided 
a and b are chosen in "good" subspaces of the algebra, and where ||.|{ is the Euclidean norm. 

The following statement already proved in J16j will be explained in the sequel. 

Theorem 3. Given subsets A,Bc¥2 and a cubic form, a, if for every proper additive quadruple 
{x,x' ,y,y') one has a{x + x') = 1, then there exists a square identity of size given by the 
cardinalities: [\A\,\B\,\A + B\]. 

Note that this statement is a refinement of Yuzvinsky's theorem |23j (see also [21], Theo- 
rem 13.A.1, p. 286). Yuzvinsky did not consider cubic forms and his result gives a lower bound 
for |A + i?| in terms of the Hopf-Stiefel function. 
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3. Cubic functions and twisted group algebras 

In this section, we develop our technique and estabhsh the relationship between cubic forms 
and non-associative algebras. 

3.1. Examples of cubic functions on F2. The space of (boolean) functions on F2 with values 
in F2 is isomorphic to the quotient space of the space of polynomials with coefficients in F2 by 
the ideal generated by the relations xf = Xi, namely 

F2 [xi,... ,Xn] / [xj -Xi : i = l,...,n). 

Every boolean function can be expressed as a polynomial in variables {xi, . . . ,x„) with coeffi- 
cients in F2, but not in a unique way. We will be interested in polynomials of degree < 3. 

Example 3.1. Let us introduce a very special cubic function ao on Fg: it is equal to 1 every- 
where except for the vectors of weight wt{x) which is proportional to 4: 

JO, if wt{x) = (mod 4) 
\ 1, otherwise. 

The explicit coordinate formula can be written as follows: 

(3.1) ao{x) = ^ XiXjXk + ^ XiXj + y^ 



XiXjXff, -\- 2 ^ 
l<i<j<k<n l<«<i<'^ 



Xi , 



l<i<n 



that is, ao is just the total sum of the monomials of degree 1, 2 and 3. Since Xi = xf = x'- , this 
function can be viewed as a homogeneous cubic form. 

The form ao is special since it is invariant with respect to the action of the group 6„ of 
permutations of the coordinates. Functions of this type are sometimes called counting functions. 

Example 3.2. Another ©^-invariant cubic form is 

"0(2;) = ^ XiXjXk 

l<i<j<k<n 

that vanishes for all x except for wt{x) = Am + 1. 
The quadratic form 

l<i<j<n l<i<n 

is equal to 1 if wt{x) = Am -|- 1 or Am + 2. 

Let us give the comparative table of values of the above forms. 



(3.2) 



wt{x) 
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The above forms, especially 
essential role for this work. 



will be useful to construct non-associative algebras that play 



3.2. Twisted group algebras. We now recall the definition of a twisted group algebra over 
an abelian group. We restrict ourselves to the case of the group ¥2, and the ground field is M; 
we refer to [5] for the general theory. 

Definition 3.3. Let / : F2 x Fg — )■ F2 be an arbitrary function of two variables. The twisted 
group algebra over Fg associated to / is the real 2"-dimensional algebra with basis {e^ | x € F2 } 
and the product given by 

(1 \ fix X } 

This algebra is denoted by (M[F^],/). 

This algebra is, in general, neither commutative nor associative. The non-commutativity is 
measured by the function 

/3(a;,y) := f{x,y) + f{y,x), 
while the non-associativity is measured by the function 

Sfix, y, z) := f{y, z) + /(x + y,z) + f{x, y + z) + /(x, y). 

Many classical algebras, such as the algebras of quaternions H, of octonions O, and, more 
generally, the Clifford algebras and the Cayley-Dickson algebras, can be realized as twisted 
group algebras over Fj; see [Il[2]. 

The associativity condition, 6f = 0, is too restrictive and does not lead to algebras interesting 
for combinatorics. For instance, the Clifford algebras are too simple for our purpose. On the 
other hand, algebras with 6f ^ can be very hard to handle, this is the case of the Cayley- 
Dickson algebras. In |17] we introduced an intermediate condition that is the symmetrized 
function of / is a 2-cocycle, i.e., 

dp = 0. 

This leads to series of new algebras. We have proved in [T7] that, if /3 as above is a 2-cocycle, 
then it is necessarily a coboundary, i.e., /3 = 5a, and a has to be a cubic form on F2. Moreover, 
a determines the twisted group algebra up to isomorphism. 
We summarize the approach in the following short way. 

3.3. Prom cubic forms to algebras. There is a canonical way to construct a twisted group 
algebra out of an arbitrary cubic form a. 

Proposition 3.4. Given a cubic form a, there exists a function f satisfying the conditions: 

(a) First polarization formula: 

f{x, y) + f{y, x) = a{x + y)+ a{x) + a{y). 

(b) Second polarization formula: 

f{x, y) + f{x, y + z) + f{x + y, z) + /(y, z) = 

a{x + y + z) + a{x + y) + a{x + z) + a{y + z) + a{x) + a{y) + a{z). 

(c) Linearity of f in 2nd variable: 

f{x, y + y') = f{x, y) + f{x, y'). 

(d) Reconstruction of a from f: 

f{x,x) = a{x). 
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Proof. The existence of / follows from an explicit formula. We replace every monomial in a 
according to the following rule: 

X'iXjX]^ I y XiXjXj]^ -\- x^yjX]^ -\- yj^XjX]^^ 

yo.oj XiXj I r Xit/j^ 

Xi I T XiXji. 

where i < j < k, and obtain this way a function / in two arguments, satisfying the properties 
(a)-(d). D 

Example 3.5. Our main examples are related to the octonions and Clifford algebras. 

(1) The function /o : F2 x Fj ^^ F2 corresponding to the form qq is as follows. 

fo{x,y)= ^ {xiXjVk + XiyjXk + yiXjXk) + ^ Xiyj 

l<i<j<k<n l^i^J^"- 

for all X = (xi, . . . ,x„) and y = {yi, . . . ,yn), elements of F2. When n = 3, the corre- 
sponding twisted group algebra is isomorphic to the classical algebra O of octonions. 

(2) The twisted group algebra (]R[F'2], /q), where fq is the function corresponding to the 
quadratic form q from Example 13.21 is isomorphic to the Clifford algebra Clo,n- 

The constructed algebras (M[F2],/o) generalize the algebra of octonions (and the Clifford 
algebras). These algebras were introduced and studied in [17] . Their properties are completely 
different from those of Cayley-Dickson algebras. 

Remark 3.6. Given a cubic form a, the choice of a function / is unique modulo a coboundary. 
More precisely, two functions of two variables, / and /', correspond to the same cubic form a, 
if and only if / + /' is a coboundary. The corresponding twisted group algebras are isomorphic. 

3.4. Why cubic functions? One cannot choose a polynomial of degree > 4, instead of a cubic 
function, in order to construct a twisting function / satisfying properties (a)-(d) of Proposi- 
tion [331 Indeed, let us apply the differential 6 to the equation in property (b). Since 5^ = 0, 
one obtains after a short computation: 

= a{x + y + z + t) 

+a{x + y + z) + a{x + y + t) + a{x + z + t) + a{y + z + t) 
+a{x + y)+a{x + z)+a{x + t)+a{y + z)+a{y + t) + a{z + t) 
+a{x) + a{y) + a{z) + a{t). 

This is exactly the condition that a is a polynomial of degree at most 3. 

3.5. A criterion of existence of a. An equivalent way to express the condition 5(3 = Q (and 
therefore the existence of a) is to require that the ternary function 5f{x,y,z) is symmetric in 
its arguments x, y, z. This means that the non-associativity is not "too wild". In particular, the 
algebras are alternative. 



4. From algebra to square identities 

4.1. The Euclidean norm. Consider a twisted group algebra (M[F2], /). Our plan is to define 
the Euclidean norm (or absolute value) on the space Mpg] — M^ and investigate its compati- 
bility with the algebra structure. 

Every element of the algebra is a linear combination of the basis elements 



E 



(^T. 6.7 



with (arbitrary) real coefficients a^ E M. We set 

E 



1,2 Y^ 2 

all := > a^. 



2 



Consider two sets A, B C ¥2 and the coordinate subspaces A and B C (M[F2], /): 

{ a I a = 2, o-x &x } and \h\b = 2, by ^y}. 
xeA y£B 

Our next task is to determine a necessary and sufficient condition on A and B that guarantees: 

(4.1) \\a\\^ \\hf = \\ab\\\ 

for all a € ^ and h ^ B. Observe that the equation (j4.ip is nothing but a square identity of size 

[\Al\Bl\A + BW. 

4.2. The normed subspaces. This idea goes back to Yuzvinsky [23], it was also used in [171 
[161 118| . The following statement was obtained in all these references, we give here a proof for 
the sake of consistency. 

Lemma 4.1. The condition \4-l^ is satisfied if and only if for all x ^ x' G A and y y^ y' £ B 
such that X + x' + y + y' = 0, one has: 

(4.2) fix, y) + /(x, y') + f{x\ y) + /(x', y') = 1. 

Proof. The product of the norm in the left-hand-side of ()4.ip is: 

\\a\?\\hf = Y,alhl. 
x,y 
On the other hand, the product of two elements is given by 



a 



b = Y^{-l)fi-'y) a, by e,+y = ^.(11 (-1)^("'^) a, by) e,. 

x+y z \x+y=z / 



The Euclidean norm of this element is 

\\a 



b\\' = E^"bl+ E (-l)-«^'^)+^(^' ^') a. by a^, by,. 

x^y X + y — x' + y' 

X ^ x' 

The monomial a^ by a^' by/ in the second summand appears twice, and has total coefficient 

(^_iy(.x,y)+f{x',y') _^ ^_iyix,y')+f(x',y)_ 

This coefficient vanishes if and only if the condition (j4.2p holds. D 
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4.3. The norm condition in terms of the cubic. Suppose now that the algebra (]R[F'2],/) 

was constructed out of a cubic form a; see Section 13. 31 The equation ()4.2p then drasticahy 
simplifies. 

Lemma 4.2. The equation {4-^ reads: a{x + x') = 1. 

Proof. Use linearity of / in second argument and substitute y' = x + x' + y to the left-hand-side 
of (14.21). After cancellation one has 



/(x, x) + f{x, x') + f{x', x) + f{x', x') = a{x) + j3{x, x') + a{x') = a{x + x'), 
thanks to the properties (a) and (d) of Proposition 13.41 D 

5. Construction of Hurwitzian sets 

In this section, we fix the cubic form a© given by (|3.ip and study the sets A C F2 satisfying 
the condition ao{x + x') = 1 for all distinct x,x' G A. In other words, the weight wt{x + x') 
is not a multiple of 4. We are interested in the sets A of cardinality \A\ = /o(2"). Such sets 
were already considered in [16] where they were called Hurwitzian sets. In this section, will 
give explicit constructions of Hurwitzian sets. In particular, we discuss a relation to the binary 
Hadamard matrices. 

5.1. Cases n = 1,2 (mod 4). In this case, /3(2") = 2n. The following choice of a Hurwitzian 
set is perhaps the most obvious. Choose the following set: 

-4 = {0, ei, 62, ... , en, ei + 62, ei + e^, . . . , ei + e^} . 

For all x,x' G A, the weight of the sum satisfies wt{x + x') < 3, and thus ai[],{x + x') = 1, 
provided x + x' ^ 0. Therefore A is a Hurwitzian set. 

Note that the above choice is not unique. However, it is easy to see that the set A is the only 
Hurwitzian set which is a "shift-minimal downset" according to the terminology of jlOj . 



5.2. Case n = 3 (mod 4). In this case, /9(2") = 2n + 2 which is the most interesting situation 
for many reasons. 

Consider the element of maximal weight: 

a; = (11 . . . 1) = ei H + en- 

One can choose the above set A, completed by w and ei -|- oj. Let us give a more symmetric 
example. 

Choose the set 

A = {0, cj, ei, 62, . . . , en, ei + uj, e2 + (v, ■ ■ ■ , en + ui} . 

The weight of a non-zero element of the sumset A + A can be one of the following four values: 
1, 2, n — 1, or n — 2. Since this is never a multiple of 4, we conclude that ^ is a Hurwitzian set. 
Moreover, it is not difficult to show that the above set is the only Hurwitzian set invariant with 
respect to the group of permutations ©„. 



5.3. Another choice in the case n = 3 (mod 8), relation to the Hadamard matrices. 

The case n = 3 ( mod 8) is a subcase of the above one. Remarkable, there is a choice of 
Hurwitzian set based on the classical Hadamard matrices. 

Recall that a Hadamard matrix is an m x m-matrix H with entries ±1, such that ^HH = ml, 
where *-ff is the transpose of H and X is the identity matrix. It is known that a Hadamard 
matrix can exist only ii m = 1,2 or m = 4s; existence for arbitrary s is the classical Hadamard 
conjecture. 

The construction is as follows. We remove the first column and than consider two 11 x 12- 
matrices, Hi,H2 with entries 0, 1. The matrix Hi is obtained by replacing 1 by and —1 by 1, 
the matrix H2 is obtained by replacing —1 by 0. 

Lemma 5.1. The rows of Hi and H2 form a Hurwitzian set in Fg**" , provided s is odd. 

Proof. It follows from the definition of a Hadamard matrix that every sum of two distinct rows 
of Hi is of weight 2s, and similarly for H2. The sum of a row of Hi with a row of H2 is of weight 
2s - 1 or 4s - 1. D 

Example 5.2. The (unique up to equivalence) 12 x 12 Hadamard matrix H corresponds to the 
following 12 X 11 binary matrices: 
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(which are related to the extended Golay code) . The rows of the matrices Hi and H2 constitute 
a Hurwitzian set in ¥2^ of cardinality 24. 

5.4. Case n = (mod 4). Recall that p{2'^) = 2n + 1 in this case. We have no construction of 
Hurwitzian set. Moreover, we are convinced that a similar situation holds for any cubic form. 



Conjecture 1. Given a boolean cubic function a on Fg with n 
such that a\, . , .n\ mi =1 and \A\ = 2?i + 1. 



(mod 4), there is no set A 



6. Proof of the main results 
We are ready to complete the proof of the results formulated in Section [2j 



6.1. Proof of Theorem [T] and Corollaries 12.11 and 12.21 Fix an arbitrary cubic form a, and 
let A C F2 be a set such that 



a 



(A+A)\{0} 



1. 



Lemmas 14.11 and 14.21 then imply ||a|| ||6|| = \\ab\\ for all a € ^ and arbitrary b G F2. We 
therefore obtain a square identity of size [\A\, 2", 2"]. The Hurwitz- Radon Theorem implies 
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that \A\ < p(2"'). This bound is sharp as follows from the constructions of Hurwitzian sets; see 
Section m Theorem [l] is proved. 

Choosing a = a© as in formula (13. ip . one obtains Corollary 12.11 

Now we deduce Corollary 12. 2[ It will suffice to consider linear subspace V. Let A C V, the 
linear subspace T^ C Fg is itself isomorphic to F^ for some m < n. Given a cubic form ao on Fg 
and let ay be its pull-back to V. One then has ay (rr + y) = 1, for all x,y & A and concludes 
by the same arguments as above. Corollary 12.21 then follows. 

6.2. Proof of Theorem [2l Fix, as above, an arbitrary cubic form a. Suppose that A and B are 
two subsets of same cardinality |^| = \B\ = r, and such that for all proper additive quadruples 
{x,x' ,y,y'). By Theorem [3l one obtains an identity of size [r,r,N], where N = \A + B\. The 
Hurwitz problem is still open in this particular case and even an asymptotic of the least value 
-^min as a function of r is not known exactly. However, it is known that asymptotically 

2 

Ctrl <NmUr)<C2Y^. 
log(r) 

where Ci and C2 are some constants. The upper bound follows easily from the Hurwitz-Radon 
theorem, and the lower bound was recently obtained in [13]. This is precisely the statement of 
Theorem [2j 

6.3. Proof of Proposition 12.31 If F C F2 is a subspace such that a{v) = 1 for all non- 
zero V (^ V, then there exists a square identity of size [\V\, \V\, \V\]. The famous theorem of 
Hurwitz [H] states that there is an identity of size [N, N, N], if and only if A^ = 1, 2, 4 or 8. It 
follows that dim(y) < 3. 
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